On the evolution of scale-free graphs 
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We study the evolution of random graphs where edges are added one by one between pairs of weighted ver- 
tices so that resulting graphs are scale-free with the degree exponent y. We use the branching process approach 
to obtain scaling forms for the cluster size distribution and the largest cluster size as functions of the number of 
edges L and vertices N. We find that the process of forming a spanning cluster is qualitatively different between 
the cases of y > 3 and 2 < y < 3. While for the former, a spanning cluster forms abruptly at a critical number 
of edges L c , generating a single peak in the mean cluster size (s) as a function of L, for the latter, however, the 
formation of a spanning cluster occurs in a broad range of L, generating double peaks in (s) . 

PACS numbers: 89.70.+C, 89.75.-k, 05.70Jk 
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Recently, many studies have been performed on complex 
networks. Such studies are mostly influenced by the random 
graph theory proposed by Erdos and Renyi (ER) 1 1]. In the 
ER model, N number of vertices are present from the begin- 
ning and edges are added one by one in the system, connect- 
ing pairs of vertices selected randomly. A remarkable result 
ER obtained is that a giant cluster of size o(N), a spanning 
cluster, appears abruptly when L reaches its threshold value 
L c , which is 0(N). Note that the formation of such a span- 
ning cluster can be viewed as a percolation transition occur- 
ring at the critical probability p c = 2L C /N(N - 1) = o(l/N). 
While the ER graph is pioneering, it is too random, and var- 
ious properties of the ER graph are not in accordance with 
those of complex networks recently discovered in real world. 
For example, the distribution of the number of edges incident 
on each vertex, called the degree distribution, is Poissonian 
for the ER graph, while it follows a power law for many real- 
world networks, called scale-free (SF) networks 

It was shown that a SF network can be generated by follow- 
ing a similar way to used in the ER model 01 0]- N number of 
vertices are present from the beginning and edges are added 
one by one. For SF networks, however, each vertex with the 
index of an integer i (i = 1, • • • ,7V) is not identical, but is as- 
signed a normalized weight w; = i~ a /Ylj=\ j~ a with a control 
parameter a G [0, 1). Each edge connects a pair of vertices 
selected with probability w{Wj. Thus the ER graph is 
generated with a = 0. The process of adding edges is repeated 
until the total number of edges in the system reaches L. This 
process of constructing networks is called the static model. 
When L is in the intermediate regime L# <C L <C Lu = LgN, 
with L( being specified below, the degree distribution follows 
a power law, pd(k) ~ with y = 1 + 1/a 6 (2,°°). How- 
ever, when L<Lg(L> L u ), the network is too sparse (dense), 
so that the degree distribution does not follow the SF behavior. 
The static model was introduced to generate SF networks with 
various y, being used to study various problems. However, it 
has not been studied yet how clusters evolve as the number of 
edges L increases, which is the goal of this Letter. 

The percolation problem of SF networks has been stud- 
ied reversely, that is, by removing randomly- 
selected vertices as well as their attached edges. In this Letter, 



we study how the cluster evolution of SF graphs proceeds as 
edges are added. Besides confirming the previous results in 

Ref. mm, we show that the process of forming a spanning 

cluster for the case of 2 < y < 3 is fundamentally different 
from that of y > 3. When y > 3, as in the case of the ER 
graph, there exists a critical number of edges L c at which a 
spanning cluster forms through many small clusters coalesc- 
ing, and the mean cluster size diverges at finite L c /N in the 
thermodynamic limit. In other words, a percolation transition 
occurs at L c . When 2 < y < 3, however, large or small clusters 
grow in a similar manner as a whole without sudden coales- 
cence occurring. As a result, the mean cluster size does not 
diverge anywhere, but instead exhibits two peaks at L p \ and 
L p 2- Near L p \, some small clusters merge together forming a 
much larger one, but it does not span the entire system. After 
passing L p \, the largest cluster as well as smaller ones con- 
tinue to grow, and the largest one becomes as large as O (N) 
around L p 2- Throughout this Letter, we will denote the case 
of y > 4 as (I), 3 < y < 4 as (II), and 2 < y < 3 as (III). The 
schematic diagram of the cluster formation is shown in Fig. [2 
We obtain characteristic numbers of edges for each case as 
a function of TV and summarize them in the phase diagram 
shown in Fig. |2] Moreover, we derive scaling forms for the 
cluster size distribution and the largest cluster size analytically 
and numerically. 

Power-law degree distribution — The probability Pd,i(k) 
that a vertex i has degree k follows approximately a Poisso- 
nian form as pd,i{k) — (ki) k exp(— (ki)) I k\ for large k and large 
N. Here the average degree of the vertex i is given by (ki) = 
K(L)r l ^y- l \ where K(L) = (h) = 2L/^[l/(y- 1)], with 
i^v(x) = Y/j=ij~ x - Note that i^v(.x) converges to the Riemann 
zeta function C,(x) for x > 1 but scales as N l ~ x /(1 — x) for 
x < 1. When L is small enough, most vertices have no edge. 
When (h) - l,thatis,Z,~^ = ^v[l/(y-l)] ~aKy- 2 )/(y-i), 
the SF behavior in the degree distribution begins to appear. 
When L> L(, the degree distribution is derived as 



Pd(k) 



Y,Pd,'( k ) -Ck 7 , 
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where c is given as c ~ (y— 1 /N 
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FIG. 1: Schematic picture for the comparison of cluster evolution 
between (1,11) (a) and (III) (b). 



TABLE I: Cluster size distribution P(s). Here A = (L - L c )/L c , 
and the scaling exponents (x,cx) are given as (3/2, 1/2) for (I) and 
((Y-l)/(Y-2),(Y-3)/(Y-2))for(II). 
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FIG. 2: Schematic phase diagram of the static model. The SF behav- 
ior of the degree distribution appears between L( and L u . A spanning 
cluster emerges at L c for (1,11), and around L p 2 for (III). The quanti- 
ties in [...] are only for (III). 



Branching process approach — As L increases beyond L(, 
small clusters form. However, clusters are still sparse and of 
tree structure JlJ. The formation of such sparse clusters can 
be understood through the multiplicative branching process 
approach frill . Here we introduce the probability distribu- 
tion P(s) as the number of vertices belonging to .s-clusters, 
clusters with s vertices, divided by N II 211 . Also we define 
another probability distribution R(s) as the number of edges 
followed by s-clusters divided by 2L. The generating func- 
tions of those quantities are defined as !P(z) = E s P(s)z? and 
$i (z) = Ls^(s)z s , respectively. Both summations run over fi- 
nite clusters only II ill , and then when clusters are sparse the 
following relations hold: 

^(z)=z/(X(z)) and ¥(z)=zg(!K.(z)), (2) 

where g(<0) = 1^=0 WW®* and /M = g'M/W with 
(k")=i: = ok"p d (k). 

To apply Eq. l|2} to the static model, we use the 
following form valid in the limit 1 — CO -C 1; g(co) ~ 
(l/N)EiLiexp[(jfc f )(ca - !)]■ Then it is obtained that z = 
« + L~ =1 fl„(l - co)" for K(L)(l - oo) < 1, where a i =/'(l), 
«2 a negative constant, and so on, while 
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for K(L)(1 — co) ^ 1, where L*J is the floor function of x and 
A = r(3 -y) [K(L) /N 1 Kt- 1 )]y~ z . The generating function CO = 



^.(z) can be obtained by inverting z = co//(co), and fP(z) is 
then obtained by using <2 (z) = zg{l{, (z)). 

Critical point — The values of !P(1) and ^.(1) are 1 only 
when /' (1 ) < 1 , while they are smaller than 1 when /' (1 ) > 
1. Thus the condition f(l) = 1, which is the same as the 
condition (k 2 )/(k) = 2 11311 . leads to a characteristic number 
of edges L c , 



i ^[i/(y-i)] 2 
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For large N, L c is N (y- l)(y- 3)/(2(y- 2) 2 ) for (1,11), and 
iV 2(Y-2)/(Y-i)^ y _ i) 2 /(2(y-2) 2 c; w [2/(y- 1)]) for (III). We 
will show later that for (LIT), a spanning cluster appears at 
L c , but for (III), the size of the largest cluster does not reach 
(N) at L c . Thus more edges are needed to generate a span- 
ning cluster. 

Cluster size distribution — The asymptotic behaviors of 
P(s) and R(s) for large s can be obtained from the singular 
parts of (p(z) and 3t(z) as z — » 1, respectively. In the static 
model, the characteristic behaviors of T (z) and (z) depend 
on the degree exponent y, classifying them into the three cases, 
(I), (II), and (III). Each case is again classified into (i) subcrit- 
ical (L < L c ), (ii) critical (L = L c ), and (iii) supercritical cases 
(L > L c ). Our results for P(s) are listed in Table I for each 
case. 

Emergence of spanning cluster — In cases of (i) and (ii), 
the size of the largest cluster S is obtained self-consistently 
through the relation, L^s-P^) = 1 — using P( s ) in Ta- 
ble |I] For example, when P(s) ~ s~ x , S is obtained to be 
S ~ N l l x . Thus when L = L c (ii), S ~ N 2 ^ for (I) |1| and ~ 
AT(Y-2)/(Y-i) for (II) Q. For (III), using P{s) ~ {Ns/L c ) 1 ^, 
we obtain S ~ Af(l'- 2 )/(Y- 1 ) i but we show below that this is not 
the incipient spanning cluster. The size of the largest clus- 
ter for the subcritical case (i) is S ~ msa{K(L)/\A\, \A\~ 1/a } 
for (I), - K(L)/\A\ for (II), and - K(L) for (III), with A = 
{L-L c )/L c . 

In case of (iii), the theory of the multiplicative branch- 
ing process yields the size of an infinite cluster through 
N{\ — £P (1)). Thus it can be identified with the largest clus- 
ter if it is larger than S at L c . From Eq. (0, 1 — £P (1) ~ 
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FIG. 3: Data collapse of S/N x l x versus AN 1 ^ for Eq. with y = 
3.6, andN = 10 4 (A), 10 5 (0), 10 6 (D), and 10 7 (O)- Here x = (y- 
l)/(y- 2) = 13/8, /./ = (y- l)/(y- 3) = 13/3, and L c /Af ~ 0.305 
from Eq. J4j are used. Inset: the same data plotted versus L/N cross 
at L/N ~ L C /AT. 

(2L/iV)(l-aj.(l)), and the value of 1 = 1 -CO is ob- 

tained by solving Eq. with z = 1. Thus we obtain, 5 - APjV 
with p = 1 for (I) and |3 = l/(y — 3) for (II) in the regime of 
AN l / p > 1, while S ~ (L/JV)V(3-Y)iV for (III) when A > 1, 
where a new scaling exponent /j = 3 for (I) and (y— l)/(y— 3) 
for (II) was used. The behaviors of (i), (ii), and (iii) lead to a 
scaling ansatz for (1,11), 

S~iV 1 /^ (I)II) (AiV 1 //<), (5 ) 

where x is given in Table|I] and m ( x ) ls constant for |x -c 
1 and behaves as xP for x 1 and \x | ~ 8 for x <C — 1 with 8 = 2 
for (I) and 1 for (II). In the thermodynamic limit, a spanning 
cluster emerges if only L > L c . On the other hand, for (III), 

S~N^ 2 V^V m (A), (6) 

where VPninC*) is a constant for |x| < 1, behaving as (1 +x) 
when x ~ — 1 and x'/( 3 ~Y) for x^> I. The size of the largest 
cluster is (jVfY -2 )/^ -1 )), which is not as large as (N), even 
when L>L C , so that L c = (A? 2 ^- 2 )/ (Y- 1 ) ) i s not a percolation 
threshold. The largest cluster size S becomes (N) only when 
L becomes as large as (N) . 

To check such scaling behaviors of S, we perform numeri- 
cal simulations for y= 3.6 and y = 2.4. As shown in Figs. [3] 
and H the data of S/aKy- 2 )/(y-i) with different N collapse 
with the scaling variables, AN 1 ^ for y= 3.6 (II), and A for 
y= 2.4 (III), respectively. Fory = 3.6, L c /N ~ 0.305 theoret- 
ically obtained in Eq. l|4} is confirmed by the data crossing at 
L c /N~ 0.306(2). 

Mean cluster size — The difference in the cluster evolution 
for (I), (II), and (III) appears more apparently in the mean 
cluster size (s) defined as (s) = Y l s^s s P( s )- The quantity (s) 
is similar to the susceptibility defined in the percolation theory 
but here we exclude the largest cluster even for L < L c . For 
(1,11), as L increases, many small clusters grow by attaching 
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FIG. 4: Data collapse of S/My- 2 )/^- 1 ) ver sus A for Eq. JSJ with 
y = 2.4 and N = 10 4 (A), 10 5 (0), 10 6 (D), and 10 7 (O)- Here 
Lc/N*! 1 ~ 2.08 from Eq. 0) is used. Inset: plot of S at L c ver- 
sus N for y = 2.4(o), 2.6(0), and 2.8(D), being in accordance 
with S ~ A'(Y _2 )/(Y- 1 ) represented by the dotted, dashed-dotted, and 
dashed line, respectively. 

edges, which continues up to L = L c , and then a spanning 
cluster forms by the abrupt coalescence of those small clusters 
as shown in Fig.^ Since we do not count the spanning cluster 
in calculating (s), (s) decreases rapidly as L passes L c . Thus 
the mean cluster size exhibits a peak at L = L c , which diverges 
in the thermodynamic limit N — > °°. The scaling behaviors of 
S and P(s) in Table|l]lead to another scaling ansatz 

(s) =N l ^<p(AN l ^) } (7) 

where <t>(x) is a constant when |x| -c 1, and behaves as 
whenx > 1, and Ix^ 1 for (I) and |x|-(y-3)/(y-2) f or (n) when 
x <C — 1. Such behaviors can be confirmed with numerical 
data fory = 3.6 in Fig.|5] 

For (III), however, the mean cluster size does not diverge 
at any value of L but has two blunt peaks (Fig. [6ji. First, it 
has a small peak at L — L p \, but it increases again as L in- 
creases beyond L p \. Edges newly introduced either create 
new clusters of size larger than 1 or merge small clusters to 
the larger one with size not as large as O (N). When L reaches 
L p 2 = (N) where the second peak arises, the largest clus- 
ter becomes as large as o(N). When L is near L c , (s) can 
be evaluated through (s) = Y.s^s s P( s )> an d ^ follows that 
(s) — 1 ~ min{^ 1 ',5 3_ Y}, where s c is a characteristic clus- 
ter size defined in Table U and the constant term 1 originates 
from the isolated vertices whose fraction is nearly 1 . Since S 
(s c ) increases (decreases) with increasing L for L > L c , (s) is 
maximal at S = s c , occurring at L p \ = bL c = o(A/ 2 'Y _2 )/(Y- 1 )) 
with b being a constant depending on y. That is verified nu- 
merically as shown in the inset of Fig. |6] For L p \ <CLCiV, 
using (s) = fP'(l), we obtain (s) to be 

M*l + ^£)-B(±)* + c(±)* t (8) 
x 1 3-yN ; \N J \N J 
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FIG. 5: Mean cluster size (s) as a function of L/N with y = 3.6 
for iV = 10 4 (A), 10 5 (0), 10 6 (D), and 10 7 (O)- The peak heights 
increase with N. The inset shows the data collapse of the rescaled 
mean cluster size (s) /N^^ versus AN 1 ^. 
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FIG. 6: Mean cluster size (s) as a function of L/N in semi- 
logarithmic scales with y = 2.4 for N = 10 4 (A), 10 5 (0), 10 6 (D), 
and 10 7 (O)- I n addition to the peak alL p i, another peak is shown at 
L p 2 — 0.1 N for N = 10 5 , 10 6 , and 10 7 , respectively. The measured 
values of L p \ (O) as a function of N are plotted in the inset together 
with the guide line whose slope is 2(y— 2)/(y— 1) for comparison. 
The solid line represents Eq. jSJ. 

where B = (5 - y)/(3 - y)[2((y - 2)/(y - l))?- 2 r(3 - 
y)]i/(3-7) and c = ( 2 /(3-y)[(2(y-2)/(y- l)y- 2 T{3- 
y)]2/(3-7)_ r ( 2 _y)[2(y-2)/(y-l)r(3-y)]^- 1 V(3-Tr). it ex- 
hibits a peak at L p 2/N ~ 0.1 for y = 2.4, close to the location 
obtained by numerical simulations (Fig. [5}. 

Dense graph — The power-law degree distribution 
in Eq. Q lasts up to L ~ L u = &v[l/(y - l)]iV = 
o(JV( 2 Y-3)/(y-i)^ arounc j which the vertex z = 1 is connected 
to nearly all vertices ~ N) and the degree distribution 
Pd (k) begins to develop a peak at k = N — 1 . As L in- 
creases beyond L u , more vertices have such maximal number 
of edges, and the graph becomes denser. 



Summary — We have studied how clusters of SF graphs 
are created and evolve as the number of edges increases. 
We obtained the cluster size distribution, the largest cluster 
size, and the mean cluster size as functions of the numbers of 
edges L and vertices N. Those quantities behave differently 
when y > 3 and 2 < y < 3. For the former, a giant spanning 
cluster forms through a sudden coalescence of small clusters, 
exhibiting a percolation transition, while for the latter, it does 
gradually, and the mean cluster size shows double peaks at 
distinct numbers of edges, L p \ and L p 2- This result implies 
that the fragmentation process of SF graphs under random 
failures on edges is qualitatively similar to (different from) 
the one under intentional attack when y>3(2<y<3)|2llSll- 
Finally, it is noteworthy that recently Aiello elal. 1 15] studied 
the possibility of forming a spanning cluster for given N and 
L = {N/2%(y— l)/£(y) as a function of y, and found that a 
spanning cluster can exist only for y < y c . ~ 3.48. However, 
the way of constructing a SF graph in their model is different 
from ours. 
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